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with nonstationary regressors 
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Accidentia Sinica and Xiamen University 

Abstract: In this article asymptotic expressions for the final prediction er- 
ror (FPE) and the accumulated prediction error (APE) of the least squares 
predictor are obtained in regression models with nonstationary regressors. It 
is shown that the term of order 1/n in FPE and the term of order logn in 
APE share the same constant, where n is the sample size. Since the model 
includes the random walk model as a special case, these asymptotic expres- 
sions extend some of the results in Wei (1987) and Ing (2001). In addition, 
we also show that while the FPE of the least squares predictor is not affected 
by the contemporary correlation between the innovations in input and output 
variables, the mean squared error of the least squares estimate does vary with 
this correlation. 



1. Introduction 

Consider a simple regression model 

(1.1) yt=0Xi-i+e t , 

where (3 is an unknown constant, St's are (unobservable) independent random dis- 
turbances with zero means and a common variance a 2 , and Xt is an unit root process 
satisfying 

(1.2) Xt=X t -l+T)t, 

with x Q = 0, r) t = Yfj^o^t-j, J2f=o M < °°> Sjlo c i ^ °> and w * being in- 
dependent random noises with zero means and a common variance a 2 . We also 
assume that e t is independent of {cjj, j < t — 1}. Note that if f3 = 1, Co = 1, Cj = 
if j > 0, and St = u>t, then (1.1) becomes the well-known random walk model (see, 
for instance, Chan and Wei Q). Having observed (yi+i,Xi), i = 1, . . . , n — 1, (3 can 
be estimated by least squares 



(1.3) P„ 



En — 1 
i=l 



XiVi+l 



En — J 
i=l 



1 2 

xi 



If x n also becomes available, then it is natural to predict y„+i using the least 
squares predictor, 

(1-4) y„+i=x n /3 n . 
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To assess the performances of the least squares predictor, we consider the final 
prediction error (FPE, Akaike [1|) 

(1.5) E { ( y n+1 - y n+1 f } = a 2 + E { x 2 n ( & - f } , 
and the accumulated prediction error (APE, Rissanen flij ]) 

n n 2 

£ ~ &) 2 = £ { £i ~ - /?)} 

i=2 i=2 

n n 

(1.6) = £ £ ? + ^_ 1 (/3 i _ 1 -/3) 2 (l + (l)) aS-) 

i=2 i=2 

where the second equality of (1.6) is ensured by Chow It is straightforward to 
see that the terms in (1.5) and (1.6), 




(1.7) 5>?-i$-i-0) a = £ 

i=2 

and 

(1.8) nx 2 n n -0) 2 -- 

When {y t } is a random walk model mentioned above, Wei (fl5j|. Theorem 4) 
showed that the rhs of (1.7) equals 2a 2 1 \ogn + o(logn) a.s. By imposing further 
assumptions on the distribution of w t , Ing ([9(, Corollary 1) subsequently obtained 
the limiting value of the expectation on the rhs of (1.8), which is 2a 2 ,. This article 
extends these two results to models (1.1) and (1.2), which provides a deeper un- 
derstanding of the least squares predictor (estimate) in situations where Fisher's 
information, Xlj=i x j > grows at a rate much faster than n, and the innovations in 
input and output variables come from different sources. The rest of the paper is 
organized as follows. Section 2 derives the asymptotic expressions for the rhs of 
(1.7). In Section 3, sufficient conditions are given to ensure that the expectation on 
the rhs of (1.8) is bounded by some finite positive constant for all sufficiently large 
n. We then apply this moment property and the results obtained in Section 2 to 
show that 

(1.9) lim E{nx 2 n n -/3) 2 } = 2a 2 . 

n — >oo 

Some discussions related to (1.9) are given at the end of Section 3. In particular, 
it is shown that while the FPE of the least squares predictor is not affected by the 
contemporary correlation between e t and uit, the mean squared error of the least 
squares estimate does vary with this correlation. In addition, we also show that 
the squares of the normalized estimate, n(fi n — /3), and the normalized regressor, 
Xn/\/n, are not asymptotically uncorrelated. 



2. An asymptotic expression for the APE 



To prove the main result of this section, two auxiliary lemmas are required. They 
are also of independent interests. 
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Lemma 1. Assume the {wt} in Section 1 satisfy sup_ 00<t<00 _E|cj t | Q < oo for some 



> 2. Let z t = X^7=o djOJt~j, where \dj\ < Cj 1 for some C > and all j > 1. 



J 3 

Then, with 7t = cr 2 YtfJo d j> 



n 

(2.1) -]T(z 2 - 7t ) = (l)a.s. 

Proof. Straightforward calculations yield that 

t t z 2 -i 

(2.2) z 2 - 7t = ]Td 2 _,> 2 -a 2 ) + 2^ ^dt-udt-^ui,. 

1=1 ; 2= 2/i = l 

By (2.2) and changing the order of summations, 

"2 2 "I / "2 J2 \ "2 / "2 J2 \ 

E^^E E^W E E^ < 

t=m ;=i \t=ni / ;=m+i\t=; / 

Z 2 =2 Ui=l \t=m / J 

"2 f '2-1 / ™2 , i \ ~1 

i 2 =m+i Ui=i \t=/2 / J 

= (l) + (2) + (3) + (4), 

where = w 2 — cr 2 . In the following, we shall show that for some ctk > 1, there 
are Cfc > 0,^1^ > 1, and £ 2 ,fc > 1 independent of n\ and n-i such that 

"2 -, 

(2.3) E\(k)r <C fe (E ^) 6 < fc , 

i— ni 

where fc = 1, ... ,4. (2.3) and Moricz (1976) imply that for some a > 1, there are 
C* > 0,£i > 1, and £2 > 1 independent of n\ and n<i such that 

i 2 _ "2 -, 

(2.4) ^--jE^i^^E^- 

t=n\ /=r/i 

As a result, (2.1) follows from (2.4) and Kronecker's lemma. 
Let ai — min{a/2,2}. Then, 



1=1 t=m 
n 2 n 2 rii 

(OK) t 1 =n 1 t 2 =n 1 T l l 2 1=1 

/ " 2 , " 2 ~ 1 1 "2 -, x 

^ E^+E ^ E ^(^*^ ai 

\t=ni *i="i r l t 2 =ti+l r 2 / 

6,1 
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where C\^,i = 1,2,3 are some positive constant independent of n\ and n 2 , 1 < 
£1,1 < a i, £2,1 = £*i/£i,ii first inequality follows from Burkholdcr's inequality, 
second one follows from the fact that < ai/2 < 1 and changing the order of 
summations, third one is ensured by sup t E\ut t \ a < oo and \dj\ < Cj -1 , which 
implies for all n x < t\,t 2 < n 2 , YaLi Mti-;^t 2 -i| ai < CmI*i - t 2 \~ ai , for some 
Ci t 4 > 0. As a result, (2.3) holds for k = 1. The proof of (2.3) for the case of 
k = 2 is similar. The details are thus omitted. To show (2.3) for the case k — 3, let 
a 3 = a. Then, by Minkowski's inequality and using Wei (1987, Lemma 2) twice, 
one obtains 



where C^j, i =1,2 are some positive constants independent of n\ and n 2 . Observe 
that for n\ < t\ < t 2 < n 2 and any 1 < Mi < M 2 < m, X^=Mi \dt 1 -idt 2 -i\ < 
C3,3(log*2 — logti)/(t 2 — ti), where 63,3 > is independent of Mi and M 2 . Using 
this fact and changing the order of summations, it follows that the rhs of (2.6) is 
bounded by C 3A (J%* t- 2 ) a ^ 2 , where C 3A is a positive constant independent of 
ni and n 2 . Hence, (2.3) holds for k = 3. The proof of (2.3) for the case k = 4 is 
similar to that of k = 3. Therefore, we skip the details. □ 

Remark 1. If in Lemma 1 z ( = Y^jLodj^t-j with \dj\ < Cj~ x ,j > 1, then the 
same argument also yields (2.1) but with j t replaced by 7* = cr^X^Lo^j- For a 
related result, Brockwell and Davis (1987, Proposition 7.3.5), assuming that u>j's are 
i.i.d. with finite second moment and dj's satisfy Y'jLo 14? I < 00 an d Y'jLo < 00 ' 
obtained (n _1 Yt=i z t) ~ 7* = °p(X)- While the moment restriction of their result 
is slightly weaker than that of Lemma 1, the identically distributed assumption 
can be dropped in Lemma 1. In addition, the assumption on dj in Lemma 1 seems 
less stringent. More importantly Lemma 1 gives a strong law of large number for 
nT x Yt=i z t under rather mild assumptions, which is one of the key tools for our 
asymptotic analysis of APE. 

Lemma 2. Assume sup_ 00<t<00 .E|a>t| Q < 00 for some a > 2 and 



Proof. First note that x t = Yij^iVj- Define N t = #X)j=i w j: where 9 = Y^jLv c j- 
Then, 



(2.6) 




(2.7) 




j>k 



Then, 




(2.8) x t =N t - S t , 

where S t = Y?j=o fj^t-j with fj = YHj+i c i- In vicw of (2.8), 



n-1 



n-1 



n-1 



n-1 



(2.9) 
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Since \fj\ — 0(j _1 ), Lemma 1 yields 

n-1 

(2.10) S j = °( n ) a - s - 

3=1 

By the law of the iterated logarithm, 

n-1 

(2.11) N j = °( n2 lo g lo S n ) a - s - 

3 = 1 

By Lai and Wei (0, (3.23)), 

(2.12) limmf l0gl " gn y^>Qa.s. 

J = l 

Now, Lemma 2 follows directly from (2.9)-(2.12). □ 
Remark 2. By assuming 



(2-13) £j 



Cj| < oo, 



Proposition 17.3 of Hamilton (1994) gives the limiting distribution of n 2 Y^j=i x % 

which is A 2 J Q w(r) 2 dr, where A = a w Y^jLo c j an d w(r) denotes the standard Brow- 
nian motion. This result immediately implies 

(2.14) logj^x 2 ^) =21ogn + O p (l). 

Lemma 2 and (2.14) provide different estimates for the difference between 21ogn 
and log(^j=i x j)i but neither is more informative than the other. On the other 
hand, we have found that the assumption on the coefficients used in Lemma 2, (2.7), 
seems to be weaker than the one imposed by Hamilton, (2.13). This can be seen by 
observing that (2.7) is marginally satisfied by Cij~ 2 < \cj\ < C2j~ 2 , Ci > C\ > 0, 
whereas (2.13) is not. 

We are now ready to prove the main result of this section. 

Theorem 1. Assume that models (1.1), (1-2), and the assumptions of Lemma 2 
hold. Also assume that sup_ 00<t<00 -E|et| Q ° < oo for some ao > 2. Then, 

n 

(2.15) ^jr-.iOSi-i -j3) 2 = 2cr 2 logn + o(logn) a.s., 
and 

n n 

(2.16) ^(y l -y l ) 2 = ^e 2 + 2a 2 logn + (logn) a.s. 
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Proof. First note that (2.9)-(2.12) yield 



(2.17) 



lim sup 



(log log n) YT ]= i x ) 



< oo a.s. 



By Wei ([151, Lemma 2) and (2.7) 
S 



(2.18) 



E 



,1/2 



fn-\ 



where C a and C* depend only on a. (2.18) and the Borel-Cantelli lemma give 
(2.19) S» = o(n 1/2 ) a.s. 

Since the law of the iterated logarithm implies 

N n -O^loglogn) 1 / 2 ) a.s., 
this, (2.8), (2.17), and (2.19) yield 



(2.20) 



V™ x 2 



o(l) a.s. 



In view of (2.20) and Wei ([la], Theorem 3), we have 



'n-X 



(2.21) Y, x tx{fc-i-P? = °^o g \^£x] 



i=1 



vj'=l 



As a result, (2.15) follows from Lemma 2 and (2.21); and (2.16) is an immediate 
consequence of (2.15) and (1.6). □ 



3. An asymptotic expression for the FPE 

Assume that models (1.1) and (1.2) hold, E(e t LOt) = 7r is a constant independent of 
t, sup_ oo<t<oo -B|e t | Q0 < oo,a > 2, and sup_ 00<t<00 E\u t \ a < oo,a > 2. Then, by 
the functional central limit theorem, continuous mapping theorem, Ito's formula, 
and some algebraic manipulations, it can be shown that 

1 \^n— 1 



(3.1) 



^(1) (po~uj Jo w a (i)dw a (t) +o-0 j l w a (t)dw b (t) 

where denotes weak convergence, (w a (t), Wb(t)) is a standard Brownian mo- 

tion of dimension 2, p = 7r/<7 2 , and er 2 . = ct 2 — /Q 2 cr 2 . If we can further show that 
for some q > 2, 



(3.2) 



1 V™ _1 T 2 



0(1), 



(iG 
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then, in view of (3.1), (3.2), and (1.8), 

nE{x 2 n n -(3) 2 } 

wl(l) (pa^ w a (t)dw a {t) + a e Jq w a (t)dw b (t) 
E { " 72 > + 



In the rest of this section, we provide sufficient conditions to ensure (3.2). In addi- 
tion, the expectation on the rhs of (3.3) is investigated (Corollary 1). Let us start 
with a useful lemma. 

Lemma 3. Let Ft t m,a. m {-) be the distribution function of cbj^t+i-ii where 

a-m = (oi 5 ■ ■ • ! 0-m.y ■ There are some positive numbers k, l, and M such that for all 
m > l,—oo < t < oo and ||a m || 2 = X^j=i a j = 1> 

(3-4) | F ttm ^ m (x) - Ft^Jy) \< M \ x - y | K , 

as | x — y |< l. Then, for any q > 0, 

(3.5) 4(^X>i) )=<>{1). 

Proof. The proof is closely related to the one given in Ing ([§], Lemma 1), with the 
assumption there being strengthened to (3.4). First note that 

1 n ~ x 1 n_1 A A n ~ 1 a.2 

(3 6) iy i 2 > iy 3 , 2 = ly ^>ly ^ 

i— 1 i—nS i—nS i—nS 

where < S < 1, and without loss of generality, n5 is assumed to be a positive 
integer. Rearranging the series on the rhs of (3.6), one obtains 

— Ti 1 lq~l X 1 



(3-7) °- E E 



j=0 i=0 ' t0 + lq 1 ^ J 

where I > max[2/f«, 1/q, (l/q){(l/S) — 1)}] and for simplifying the discussion, Iq 
and { (1 — S)n}/(lq) are also assumed to be positive integers. By the convexity of 
function x~ q ,x > 0, 



' . n-l 



< 



(3.8) 



(i-5)<n -9 lq 



n 2 ^' " 1 lq J (1 - 8) n 

1=1 / V a ✓ \ / 

lq [ lq— 1 a; r (!_;,.. 



X 



E E 



3=0 I l= " u T l q 



nS + (1 ^ )n i + j 



In view of (3.8), if one can show that for some positive number C independent of 
j, the following inequality, 



9 \ — Q 



(3.9) E V "Ty- TIL <C<oo, 
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holds for all j = 0, 1, . . . , { (1 — S)n/(lq) } — 1 as n is large enough, then (3.5) 
follows. The rest of the proof only focuses on the case where j — 0, because the 
same argument can be easily applied to other j's. 
For i = 0, . . . , Iq — 1, define 

(3.10) Y n ,i = !n6+ {l ~^ n i | x nS , a- S) 



Iq 



(l-S)r, 

{l-6)n.^- 1/2 '±- 



(3.11) W nti = \nd+ hi i \ Yl - — - 



m=0 



where f 3 = Yfi=o c h and 

(3.12) F„,, = F n ,i - W n , 

(Note that z t = £^=o fj^t-j-) Then, 



, i=0 




*(xx«) =/ MIE^I >/U 



(3.13) 



Pr (_t-V(a«) < y n i < t -V(a«) , i = , . . . , l« - 1 ) <ft 

/.oo f /'<?-! 



F n ,i q -i, W n ,i,F nti , i = Q,,..,lq-2 \) dt, 



where A n>i = { -t"V(2«) < Y n , t < r 1 /' 2 ') }. In view of (3.10)-(3.12), for < p < 
Iq — 1, < i < p, and < j < p — 1, W n , p is independent of {F n ^,W n j). In 
addition, va,r(W ny i) > £ > 0, where i = 0, . . . , Iq — 1 and £ is a positive number 
independent of n and z. According to these facts, (3.4), and arguments similar 
to those used in (3.10) and (3.11) of Ing [9j, there exist some positive numbers 
0<C"<oo,0<s<oo, and a positive integer Nq such that for all n> No and all 
t > s, 

(3.14) E (^J[l An }j <C't-^ 1 ^ 2 . 

Since, by construction, I > 2/k, (3.13) and (3.14) guarantee that for n > No, 

\ ~ q ,oo 

E wy*A - s+c 'l r{Ki)/2dt< °°> 

which yields (3.9). □ 

Lemma 4 below shows that (3.4) is easily found in many time series applications. 

Lemma 4. If ' ui t 's are i.i.d. random variables satisfying E(uji) = Q,E(u)\) = crj > 
0, and E(\uji\ a ) < oo for some a > 2. Assume also that for some positive constant 
Mo < oo, 

roc 

(3.15) / \<p(t)\dt < Mo, 
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where tp(t) — E{e ltu}1 ) is the characteristic function of U)\. Then, for all — oo < t < 
oo, m > 1 and ||a m || = 1, there is a finite positive constant M\ such that 

(3.16) Sup /t,m,a m O) <M U 

— oo<:e<oo 

where /t. m ,a m (') is the density function o/X^Jli a j u! t+i-j- As a result, (3.4) follows. 

Proof. The proof is inspired by the ideas of Feller ([7], p. 516), which deal with the 
special case, aj = m -1 / 2 for all j = 1, . . . , m. Without loss of generality, assume 
cr 2 = 1. Denote Y = J2™=i djUt+i-j- Then, ip. Y (t) = E(e ltY ) = fTJLi ipj(djt). By 
Chow and Teicher ([6(, Theorem 8.4.1), 

a 2 t 2 

<P(a 3 t) = l-^+o(a 2 3 t 2 ), 
as a 2 t 2 — > 0. This gives for \<Xjt\ < SI, where 8% is some small positive constant, 

(3.17) \ip( aj t)\ <1-^. 

On the other hand, since (3.15) yields \(f(t)\ — ► as \t\ — > oo, by Chow and Teicher 
(@, Corollary 8.4.2), \ip(t)\ < 1 for all t ^ 0, and hence for all \t\ > (with 
defined above), 

(3.18) \tp(t)\ < 9 U 

where 9\ is some positive constant < 1. Now, by (3.17), 



(3.19) 



/oo m /> 2 r m 

niv(aj*)i^< / e^dt+ / ni^i* 
-°°i=i • y l*l<oi: J \t\>ctj=i 

/OO 9 /• 771 

-oo J|t|>^J- 



o m j = l 



where Oj is a permutation of |<ij| satisfying O m > O m _i > ••• > 0%. For i > 
S\/O m , (3.17), (3.18) and the fact that 

4(1 - 6i)a 2 5f 



imply 

a 2 t 2 a 2 8* 2 a 2 S* 2 

(3.20) ^(o^)! < max{l - -±-,9i} < max{l - X } < 1 - £- 7 1 



4<3 2 ' iJ - s 402 ' 

rn rn 

where < £ < min{l, 4(l-^)/5f }. In view of (3.20) and the fact that O) 
l-O 2 



m i 



(3.21) = i *~ — — t j \<p(t)\dt 





poo 




/ e 




— e 4 


1 - 


2 

4 °m 









< e 4 supxe 4 Mo < oo. 

X>1 
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By (3.21), (3.19), and the fact that 



^ roc m 

SUp /t,m,a m (x) < — / Yl l</KM)M<' 
-oo<a;<oo J — oOj_^ 



(3.16) follows. In addition, it is not difficult to see that (3.4) can be deduced 
from (3.16). □ 

In the following lemma, some moment bounds for (l/y/n)x n and (1/n) Y^i=i x i x 
£i + i, are obtained. 

Lemma 5. Assume models (1.1) and (1.2), with sup t E{ \st\ q ) < oo and 
sup t E{ \oJt\ q ) < oo, for some q > 2. Then, 



< oo, 
i \ 



(3.22) 


(i) 


sup£ ( 
n>l V 


1 


(3.23) 


(ii) 


supi? 
n>l V 


1 n— 1 

^E 

i=l 



.1 i c 



< OO. 



Proof. The proof of Lemma 5 is similar to that of Ing ([9j, Lemma 1). The details 
are omitted. □ 

Armed with the previous results, (3.2) is proved in the following theorem. 

Theorem 2. Assume that (1.1), (1.2), (3.4), sup t E(\ e t \ q ) < oo, and 
sup t E(\ u>t \ q ) < oo are satisfied, where q > 4. Then, (3.2) holds. If we further 
assume that E{etOJt) = n is a constant independent oft, then (3.3) follows. 

Proof. By Lemmas 3 and 5, (3.1), and an argument similar to the one used in 
Theorem 1, the claimed results can be obtained. □ 

The FPE of the least squares predictor is obtained in Corollary 1 below. 

Corollary 1. Assume that (2.7) and all assumptions of Theorem 2 hold. Then, 
(1.9) follows. 

Proof. By (2.15), (3.2), and Minkowski's inequality, 

1 " 

(3.24) lim V E{x1_x0i-i ~ P) 2 } = 2a 2 , 

?woo logn * — ' 

° i—m* 

where m* is some positive integer independent of n. Now, (1.9) is guaranteed by 
(3.3) and (3.24). □ 

Corollary 1 and Theorem 1 together indicate an interesting result that the term 
of order logn in the APE and the term of order n _1 in the FPE share the same 
constant, 2a 2 . For applications of this type of results to model selection problems, 
see ll|. Corollary 1 also shows that the FPE of the least squares predictor is not 



affected by the contemporary correlation between e t and ujf This is a somewhat 
unexpected feature because the least squares estimate itself does not possess this 
property. More specifically, by direct calculations, we have 

,„ ra a\ 1 p(Ju So w a (t)dw a (t) + a e f w a (t)dw b (t) 
3.25) n((3 n -j3)=>- , , 

A So 
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and 

1 (pcr u w a (t)dw a (t) + a e w a (t)dw b (t)^ 



(3.26) n 2 {f3 n -P) 2 



A 2 



(fo<(t)dt 



where A is denned in Remark 2. By (3.26), an argument similar to that used in the 
proof of Theorem 2, and some algebraic manipulations, 



lim n 2 E0 n -0f = E 

n — !-oo 

(3.27) 



1 [pom w a (t)dw a (t) + a e f Q w a {t)dw b (t) 



A 2 



P 2 jpf Jo w a {t)dw a {t) 



e ju _r — — +^hE 



i2 V !u W a(t)dt J <?°l \tiwi{t)dt)' 

where i 2 = A 2 erJ 2 . Ing ([§], (4.3)) showed that 



2 



(3.28) E ( Jo' Mt)dw a (ty ^ i3 3 

v Jo <m ) 

By (3.6.4) and (3.6.5) of Arato and using a numerical integration method, 

(3.29) E\— i — )=5.6. 

\siwi(t)dtj 

Consequently, (3.27)-(3.29) imply 

2 2 

(3.30) lim n 2 E0 n - /?) 2 = ^-13.3 + -^-5.6, 

n—>oo l z L a t) 

which obviously varies with the strength of dependence between e t and u>t- In 
particular, if a 2 — a 2 , then p — corr(et, u>t) and (3.30) can be rewritten as 



1 

n—*oo i 



(3.31) lim n 2 E((3 n - (3) 2 = -^[p 2 13.3 + (1 - p 2 )5.6l. 



As observed in (3.31), the larger the magnitude of the correlation between e t and 
ujt is, the larger the mean squared error of the least squares estimate is, a result 
new to the literature. 

As a final remark, we note that the square of the normalized estimate, n 2 ((3 n — 
f3) 2 , and the square of normalized regressor, x 2 /n, are not asymptotically uncorre- 
cted. To see this, observe that liirin^oo 

E(x 2 n /n) = A 2 , which together with (3.30) 

and Corollary 1, gives 



lim E [ ^ ) E ln 2 n - f3) 2 \ = 13.3p 2 cr 2 + 5.6aJ 

n— >oo V n J I J " 



5.6cr 2 + 1.1 p^i > 2cr 2 
lim El^n 2 n -Pf 



71 — >OG 



Therefore, x^/n and n 2 (f3 n — f3) 2 are (asymptotically) negatively correlated, which 
suggests that larger variation of yield a better estimation result. It is worth 
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mentioning that this special feature does not exist for the (asymptotically) sta- 
tionary regressor. For example, when xt — sxt-i + Vti with \<;\ < 1, following an 
argument used in Ing [icj |. it can be shown that 

lim E{xl)E{[yfti0 n - p)f \ = lim E {x 2 n n((3 n - /3) 2 } = a 2 . 

Therefore, the square of the normalized estimate, n(/3 n — (3) 2 , and the square of the 
(normalized) regressor, x 2 n , are asymptotically uncorrelated in this case. 

References 

[1] Akaike, H. (1969). Fitting autoregressive models for prediction. Ann. Inst. 

Statist. Math. 21 243-247. 1MR0246476I 
[2] Arato, M. (1982). Linear Stochastic Systems With Constant Coefficients: 

A Statistical Approach. Springer- Verlag, New York. MR0791212 
[3] Brockwell, P. J. and Davis, R. A. (1987). Time Series: Theory and 

Method. Springer- Verlag, New York. MR0868859 
[4] Chan, N. H. and Wei, C. Z. (1988). Limiting distribution of least squares 

estimates of unstable autoregressive processes. Ann. Statist. 16 367-401. 

IMR0924877I 

[5] Chow, Y. S. (1965). Local convergence of martingale and the law of large 

numbers. Ann. Math. Statist. 36 552-558. MR 0182040I 
[6] Chow, Y. S. and Teicher, H. (1997). Probability Theory: Independence, 

Interchangeability, Martingales, 3rd ed. Springer, New York. MR1476912 
[7] Feller, W. (1971). An Introduction to Probability Theory and Its Application, 

Vol. II. Wiley, New York. MR0270403I 
[8] Hamilton, J. D. (1994). Time Series Analysis. Princeton University Press, 

Princeton, NJ. IMR12780331 
[9] Ing, C. K. (2001). A note on mean-squared prediction errors of the least 

squares predictors in random walk models. J. Time Ser. Anal. 22 711-724. 

IMR1867391 

[10] Ing, C. K. (2003). Multistep prediction in autoregressive processes. Economet. 

Theory 19 254-279. MR1966030] 
[11] Ing, C. K. (2004). Selecting optimal multistep predictors for autoregressive 

processes of unknown order. Ann. Statist. 32 693-722. MR2060174 
[12] Lai, T. L. and Wei, C. Z. (1982). Least squares estimates in stochastic 

regression models with application to identification and control systems. Ann. 

Statist. 10 154-166. MR 0642726I 
[13] Moricz, F. (1976). Moment inequalities and the strong laws of large numbers. 

Z. Wahrscheinlichkeitstheorie verw. Gebiete 35 299-314. IMR0407950I 
[14] Rissanen, J. (1986). Order estimation by accumulated prediction errors. In 

Essays in Time Series and Applied Processes, Special vol. 23A of J. Appl. 

Prob. 55-61. I MR0803162I 
[15] Wei, C. Z. (1987). Adaptive prediction by least squares predictors in stochas- 
tic regression models with application to time series. Ann. Statist. 15 1667- 

1682. MR0913581 



